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Abstract 

Background: Imaging Mass Spectrometry (IMS) provides a means to measure the spatial distribution of biochemical 
features on the surface of a sectioned tissue sample. IMS datasets are typically huge and visualisation and subsequent 
analysis can be challenging. Principal component analysis (PCA) is one popular data reduction technique that has 
been used and we propose another; the minimum noise fraction (MNF) transform which is popular in remote sensing. 

Findings: The MNF transform is able to extract spatially coherent information from IMS data. The MNF transform is 
implemented through an R-package which is available together with example data from http://staff.scm.uws.edu. 
a u/~g I en n/#Softwa re. 

Conclusions: In our example, the MNF transform was able to find additional images of interest. The extracted 
information forms a useful basis for subsequent analyses. 
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Background 

Imaging Mass Spectrometry (IMS) provides a means to 
measure the spatial distribution of drug metabolite, lipid, 
peptide and protein features on the surface of a sec- 
tioned tissue sample (see [1] and references therein). 
Typically, IMS methods utilise freshly frozen sections of 
tissue mounted onto conductive slides. These are coated 
with matrix followed by MALDI-ToF/ToF spectra acquisi- 
tion at anywhere from hundreds to thousands of positions 
across a tissue, the spatial locations of which are anno- 
tated. For example, a section of coronal murine midbrain 
can generate more than ~ 2000 spectra. Data acquisition 
at 0.1 GS/s over an m/z range 1000-26000 yields individ- 
ual mass spectra with more than 11,000 plotted points. 
The resulting data set is enormous and thus difficult to 
process, visualise and analyse effectively. 

The data can be thought of in two ways, firstly a set of 
mass spectra acquired at a spatial array of spots, and sec- 
ondly as a stack of ion intensity maps, each map being 
akin to a low resolution image. Software such as Biomap 
and flexlmaging (Bruker Daltonics) view IMS data as ion 
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intensity maps and include features such as data normali- 
sation and noise spectra exclusion (see Figure 1). However 
the choice of ion intensity maps to view is largely user 
driven and images are noisy. Further data analysis using 
external software packages is possible, for example, (Clin- 
ProTools for principal component analysis (PCA), hierar- 
chical clustering (HC) of spectra, or spectral model gen- 
eration [2-5]. Other analysis techniques used on IMS data 
include kriging of ion intensity maps [6] and supervised 
classification methods, for example, random forests [7]. 

Current methods typically use spectral features, not 
spatial information, to guide analysis. Hence the predom- 
inance of PCA and HC type approaches. We propose the 
use of the minimum noise fraction (MNF) transform [8] 
to, firstly, determine the most interesting spatial represen- 
tations of IMS data, and secondly, form the basis of data 
reduction for subsequent analysis. The MNF transform 
has previously been used on hyper-spectral images of tis- 
sue samples [9] but this is the first use of such a technique 
on IMS data. 

Findings 

Principal Components Analysis 

Principal Components Analysis (PCA) treats the IMS data 
as a collection of spectra. Therefore, in PCA, the spatial 
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Figure 1 Image (A) of coronal murine midbrain section, and ion intensity maps at m/z of 671 9 (B), 9980 (C) and 1 41 36 (D). m/z are 

approximate, from flexlmaging V2.1 . Scale bar is 1 mm. 



structure of the spots is not relevant and so the data can 
be represented as a matrix Z = {Z^} where i = l,...,n 
ranges over the spots on the tissue and k = l,...,p ranges 
over the mass charge ratios in the mass spectrum. Let 
Z = {Zfc} be a typical mass spectrum. PCA seeks linear 
combinations of intensities over the mass charge ratios 
that maximizes variance. That is, the first principal com- 
ponent is defined by a vector a = {a^} with a chosen 
so that Var(VZ) is maximised. Second and subsequent 
principal components maximize variance subject to being 
uncorrelated with all previous principal components. 

If Ez is the covariance matrix of the mass spectra, that 
is, the {k\J<2) entry is the covariance of the ion inten- 
sity measured at the /q-th m/z ratio and the ion intensity 
measured at the /c2-th m/z ratio, then the first principal 
component maximises a t T^za subject to a suitable scale 
constraint such as eta = 1. Generally, Ez is unknown so is 
estimated using the sample covariance matrix Sz given by 



(Sz)kik 2 



— ~ ^2 ( Z *i ~ Z 'h) i Z ik 2 ~ Z -k 2 ) 



where 



z k= Zik 



It should be noted that the mass spectra are unlikely 
to form a set of independent observations since spatially 
close spectra will likely be correlated. 

The MNF transform 

PCA makes no use of the spatial structure of the observed 
mass spectra. The Minimum Noise Fraction transform 
uses a simple model to allow the spatial structure to 
influence the analysis. Here we modify the notation to 
emphasise the spatial aspect; let Z(x) be the mass spec- 
trum at spatial location x. In our case, x will be a spot on 



the tissue section indexed by a horizontal and a vertical 
coordinate. A possible model for Z(x) is 

Z(x) =M(x) +N(x) 

where M(x) represents the signal at x and N(x) is the noise 
at x. 

This is to be interpreted as "the mass spectrum at spot x 
is composed of a spatial signal mass spectrum plus a noise 
mass spectrum". We assume the signal and noise compo- 
nents to be independent, and the noise component to have 
low spatial covariance. The signal component would likely 
have high spatial covariance. Both components would still 
have a covariance between intensities at differing mass 
charge ratios, represented by covariance matrices E^i 
and Ejy. 

The MNF transform seeks linear combinations of inten- 
sities over the mass charge ratios that maximizes signal to 
noise ratio (SNR). That is, the first MNF band is defined 
by a vector a — {a^} with a chosen so that SNR = 
Var(^M)/Var(^AT) is maximised. Replacing the vari- 
ances by expressions in terms of the covariance matrices 
we see that; 



SNR: 



a^Ma 
a^Hjqa 



In PCA we need an estimate for Sz, whereas for the 
MNF we need estimates of and X^. These are not as 
straight-forward to obtain as in PCA, since the signal M 
and noise N components are not directly observed. How- 
ever, by noting that (by independence) Ez = + £w we 
see that the SNR is maximised when the following ratio is 
maximised, 

a^za 
a t Hjqa 

Thus only an estimate for E^v is required. In reality, only 
an estimate of E^v or is required, and we find it easiest 
to estimate the former. 
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Green et al. [8] propose a shift difference method to esti- 
mate E/v and Berman et al [9] propose using the covari- 
ance of residuals from a local quadratic fit. In the latter 
case, a quadratic function is fit to a 3 x 3 neighbourhood of 
each spot for each mass charge ratio, and a residual com- 
puted at the spot. This produces a set of pseudo-residual 
data and the sample covariance of this used as the esti- 
mate. These original applications of the MNF transform 
are based on hyperspectral images where the spots are 
very close together. Here the spots from which MS spec- 
tra are collected are somewhat separated. For this reason 
we have used a simpler local linear fit, based on the ideas 
of [10], which is similar to using a symmetrical set of 
shift differences. Using the simpler approach places less 
reliance on spots that are further apart. Although there is 
scope to investigate other approaches, preliminary work 
shows little difference when a quadratic signal fit is used 
in this case. 

Each spot (except edge spots) has two horizontal and 
two vertical neighbours. Averaging these four values gives 
the prediction of a local linear fit at the central spot, 
from which a pseudo-residual can be derived. Since the 
spots are on a regular grid, this corresponds to the resid- 
ual from a local linear fit to the four neighbouring spots. 
This procedure produces a set of pseudo-residuals (one 
for each spot at each mass charge ratio, subject to simple 
modification at edge spots) from which the sample noise 
covariance matrix Sm can be formed. We use this as the 
estimate of Ejy. 

Implementation 

PCA corresponds to the maximisation of cfiSza subject 
to a scale constraint such as a 1 a = 1. Lagrange multi- 
pliers can be used to show that at the maximum, a is 
the eigenvector of Sz corresponding to the largest eigen- 
value. Subsequent principal components are denned by 
eigenvectors corresponding to subsequent eigenvalues. 



A similar argument shows that the a which maximises 
the ratio 

a l S^a 
satisfies 

SzCL — XSj\[CL 

This is a generalised eigenproblem (see [11] for example). 

For both PCA and MNF the uses of all mass charge 
ratios would produce sample covariance matrices that are 
extremely large, so firstly some pre-filtering is used. In 
PCA, this is often a peak identification method, or selec- 
tion by taking all the mass charge ratios for which the 
intensity exceeds some threshold (in some or all spots). 
For the MNF transform, we use only those mass charge 
ratios whose SNR values exceed a threshold. This SNR 
corresponds to the ratio of diagonal entries in Sz and 
Sn- The threshold is chosen so that the matrices are of a 
manageable size. 

Our implementation uses the LAPACK [12] routines for 
generalised eigenproblem interfaced to the R system for 
statistical programming [13]. All aspects of this process 
are automated in our code. The only manual interventions 
required have to do with pre-filtering of the signals and 
the choice of the number of bands for subsequent anal- 
ysis. Both of these manual interventions are required by 
PCA also. 

Results 

We demonstrate the method using a section of 10 /xm 
coronal murine midbrain. The section was desiccated 
for 30 minutes followed by washing in 70% and 100% 
EtOH for 30 seconds each, and subsequently desiccated 
until use. 20 mg/mL of 2,5-dihydroxybenzoic acid in 50% 
MeOH and 0.2% TFA matrix was deposited using 3 phases 
on an ImagePrep station. Mass spectrometry analysis 




Figure 2 The first six principal component images of a coronal murine midbrain section. 
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Figure 3 The first six MNF transformed images of a coronal murine midbrain section. 



was carried out on an UltraFlex III MALDI-TOF/TOF 
machine in linear positive ion mode. ClinProT calibrants 
(1:20 dilution with matrix) were used to calibrate the 
instrument. Data acquisition used flexControl V3.3, with 
300 shots taken at each spot and summed. Mass spectra 
were acquired in m/z range 1000-26000 at a rate of 0.1 
GS/s. Figure 1 shows the image of the section (A) and flex- 
Imaging (Bruker Daltonics) generated ion intensity maps 
for three m/z ratios (B-D). 

The processed data consists of intensities at 11280 mass 
charge ratios, repeated across a grid of 2012 spots over 
the tissue slice. The data were first logged and then back- 
ground corrected by using a 5-knot robust spline fit to 
estimate baseline. Pre-filtering of mass charge ratios was 
carried by thresholding intensities (in the case of PCA) 
or SNRs (in the case of MNF) so that 650 were retained. 
PCA and MNF transforms were computed. This means 
that PCA operated on the 650 mass charge ratios with the 
highest intensity, whereas MNF used the 650 mass charge 



ratios with the highest estimated signal to noise. The 
choice of 650 data points stems from trial and error and a 
pragmatic desire to use manageable covariance matrices. 

Figure 2 shows the first six principal component images 
of the data, there appears to be three images with signif- 
icant spatial structure, and the remaining three appear to 
be noise. 

Figure 3 shows the first six MNF bands. There are four 
images with clear spatial structure, so the MNF transform 
has been able to extract further information. 

Subsequent Analysis 

Deininger et al. [3] show the use of the principal compo- 
nents in hierarchical clustering, and this can also be done 
with the MNF bands. Hierarchical clustering is useful for 
identifying regions of the tissue with relatively homoge- 
nous properties. Using the PCA or MNF bands signifi- 
cantly reduces the computational complexity of clustering 
without overly reducing its usefulness. 
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Taking the first four MNF band images, and applying 
hierarchical clustering we can determine seven clusters. 
Figure 4 shows the results. Panel A shows the spots 
coloured according to cluster and panel B shows the 
average mass spectrum for the spots in each cluster. 

As with PCA, the choice of the number of MNF bands 
to use in subsequent analysis (such as hierarchical clus- 
tering) is somewhat ad-hoc and depends on the form of 
such analysis. For clustering and classification there are 
many methods for choosing the number of features but we 
regard this as a topic for further research. 

In this instance the number of components chosen (six) 
was primarily chosen for convenience and subjective rea- 
sons. The 5th and 6th PCA plots still show some faint 
internal structure, whereas subsequent ones do not (not 
shown). So we use 6 components for both PCA and MNF 
for consistency. 

More generally the number of PCA components can 
be chosen using percent total variation explained argu- 
ments. In this approach, the sum of the eigenvalues for 
the chosen components divided by the sum of all the 
eigenvalues, converted to a percentage, is considered. A 
threshold percent (eg. 80 or 90%) is then chosen and the 
number of principal components fixed at that which first 
exceeds the threshold. It is not so easy to apply this tech- 
nique for MNF as the eigenvalues represent signal to noise 
ratios and as such are not additive. However, since they are 
signal to noise ratios, they are scale-free and can be sub- 
ject to thresholds themselves ie. take all components with 
eigenvalue (signal-to-noise ratio) greater than a threshold. 
Examples of such a threshold might be one, ie. signal and 
noise are approximately equal. 

Conclusion 

We have shown that the minimum noise fraction trans- 
form is a potent addition to the suite of analysis tools 
available for the analysis of Imaging Mass Spectrometry 
data. Like PCA, we have further demonstrated that the 
MNF bands generated can be used as summaries of the 
mass spectra to analyse the spatial characteristics of a tis- 
sue slice. We regard the MNF transform as providing a 
useful alternative to PCA in Imaging Mass Spectrometry. 
Its denning feature is that is uses estimates of spatial signal 
to noise ratio to sequentially define bands whereas PCA 
uses only total variation (signal plus noise). 

Both PCA and MNF are computationally efficient when 
compared to the data acquisiton and preprocessing steps 
involved. In our implementation, all code was written in 
R and C and is therefore platform independent. However, 
the flexlmaging provided data in a proprietary format 
that required the use of a Windows only proprietary tool 
(CompassXport). We have successfully used emulation 
software on Linux and Mac OS X based systems to run 
this tool. 



Availability and requirements 

Project Name: Computing Minimum Noise Fraction 
Transforms of Imaging Mass Spectrometry Data; 
Project Home: http://staff.scm.uws.edu.aU/~glenn/# 
Software- 
Operating Systems: MNF code is in R and C and is com- 
patible with Windows, Mac, and Linux; 
Programming Language: R, http://cran.r-project.org 
and C; 

Other Requirements: caMassClass,[14]; Data Acquistion 
and conversion software (flexImaging/CompassXport); 
License GPL-2; 

Restrictions to use by non-academics: none; 
Availability of supporting data 

The software and supporting data are available for down- 
load from the project home at http://staff.scm.uws.edu. 
au/~glenn/#Software. 
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